Pre-distortion apparatus

ABSTRACT

Pre-distortion apparatuses and methods for a non-linear component are provided. The apparatus comprises an adaptive block for generating a plurality of correlation coefficients, which are used to weight a plurality of synthesis work functions to pre-distort a given signal. The adaptive block can be driven by an error signal generated from a feedback signal from the non-linear component output signal and a delayed version of the input signal. The apparatus is capable of being operated directly at radio frequency. Also provided are apparatuses and methods for generation of quadrature signals, transconductance amplification employing negative resistance, variable-gain amplification, and envelope detection.

FIELD OF THE INVENTION

This invention relates to a pre-distortion apparatus for non-linear components, which can be used as a linearizer for a radio-frequency (RF) power amplifier (PA), as well as various component circuitry and methods for implementing said pre-distortion apparatus.

BACKGROUND OF THE INVENTION

A communication system typically comprises multiple signaling nodes, such as user terminals, base stations, routers, switches, links, and so on. The nodes transmit and/or receive signals over a communication medium such as copper wire, optical fiber, or the atmosphere in the case of a radio interface.

In general, the signaling function requires some sort of signal amplification, since the amplitude of a signal is generally attenuated during transmission between nodes. For example, signals transmitted over a radio link may be attenuated due to such factors as propagation loss and multipath fading. A signal amplifier is thus typically provided to compensate for the attenuation.

In particular, a power amplifier (PA) is used to amplify a signal before transmission over a radio interface. When operated near saturation, PA's behave nonlinearly, leading to unwanted distortion of the signal. Such distortion can include so-called amplitude-amplitude (AM-AM) distortion and amplitude-phase (AM-PM) distortion.

To suppress unwanted PA nonlinearity, techniques such as using a pre-distorter have been investigated. A pre-distorter, disposed before a PA in the signal path, acts on an input signal in such a way that the combined effect of the pre-distorter and the PA is linear and memoryless. The advantages of using a pre-distorter include reducing spurious emissions, as well as improving power efficiency and in-band signal processing accuracy.

Various pre-distortion techniques have been described in the prior art. Look-up table based digital pre-distortion entails measuring the non-linear characteristics of a PA and storing a “mirror image” of those characteristics in a look-up table. Alternatively, such “mirror image” characteristics may be pre-programmed into pre-distortion components operating directly at RF in a technique called “analog feed-forward.” Yet another pre-distortion technique is polynomial-based digital pre-distortion, which entails digitally pre-distorting a signal at baseband using polynomial basis functions. With appropriate feedback, time-varying PA characteristics can be optimally adjusted using the latter approach.

The present disclosure describes various novel apparatuses and methods for linearizing non-linear output signals that may be used either in conjunction with or to the exclusion of the prior art techniques described above.

SUMMARY OF THE INVENTION

The present disclosure describes novel apparatuses and methods for linearizing the output signal of non-linear components such as RF power amplifiers, as well as various component circuitry for implementing said apparatuses and methods.

One aspect of the invention provides a pre-distortion apparatus comprising: a datapath signal, a reference signal, and a feedback signal; an error signal generator comprising a difference amplifier, wherein the input signals to said difference amplifier comprise: 1) a first amplifier input signal derived from the reference signal, and 2) a second amplifier input signal derived from the feedback signal, and wherein the output signal of said amplifier comprises an error signal; an adaptive block comprising: an analysis basis function generator for generating a plurality of analysis basis functions; a plurality of correlators for correlating the error signal with each of said plurality of analysis basis functions, the output signals of the plurality of correlators comprising a plurality of correlation coefficients; a synthesis block comprising: a synthesis work function generator for generating a plurality of synthesis work functions; a synthesizer for generating a weighted sum of said plurality of synthesis work functions, wherein each synthesis work function is weighted by a corresponding one of said plurality of correlation coefficients; a multiplier for multiplying said datapath signal with said weighted sum of said plurality of synthesis work functions. Also provided are various methods and means for achieving said pre-distortion.

A further aspect of the invention provides an apparatus for generating a first differential signal having a quadrature-phase relationship with a second differential signal comprising: a differential gyrator means having a first and second port for inputting said first differential signal, and a third and fourth port for outputting said second differential signal; and a coupling means for coupling the third port of the differential gyrator to the fourth port of the differential gyrator means. Also provided are various means and methods for generating said quadrature-phase signals.

Yet a further aspect of the invention provides a transconductance amplifier comprising: a current source generating a current at a current terminal; a differential pair comprising two transistors, each transistor having a source terminal connected to the current terminal; a load device connected to the drain terminal of each transistor in said differential pair; a negative resistance block coupled in parallel with the drain terminals of the transistors in said differential pair. Also provided are various means and methods for such transconductance amplification.

Yet a further aspect of the invention provides an apparatus for generating a first signal having a quadrature-phase relationship with a second signal, said apparatus comprising: a differential reference signal comprising a first single-ended input signal and a second single-ended input signal, wherein said first single-ended input signal is substantially 180 degrees out of phase with second single-ended input signal; a first square-root function block for generating an output signal proportional to the square root of the first single-ended input signal; a second square root function block for generating an output signal proportional to the square root of the second single-ended input signal; wherein said first signal comprises the output signal of said first square root function block and said second signal comprises the output signal of said second square root function block. Also provided are various means and methods for generating said quadrature-phase signals.

Yet a further aspect of the invention provides an amplifier for providing a variable gain to an input signal, said amplifier comprising: a first transconductor having a differential input and a differential output, and a variable transconductance; a second transconductor having a differential input and a differential output, and a variable transconductance; a third transconductor having a differential input and a differential output, and a variable transconductance; a fourth transconductor having a differential input and a differential output, and a variable transconductance; a first coupling capacitance between the nodes of said differential input of said second transconductor; a second coupling capacitance between the nodes of said differential input of said third transconductor; wherein: the differential output of said first transconductor is coupled to the differential input of said second transconductor; the differential output of said second transconductor is coupled to the differential input of said third transconductor; the differential output of said third transconductor is coupled to the differential input of said first transconductor; the differential output of said third transconductor is coupled to the differential input of said third transconductor; the differential output of said fourth transconductor is coupled to the differential input of said third transconductor; the differential input of said fourth transconductor comprises said input signal; and the differential output of said third transconductor comprises an output signal. Also provided are various means and methods for providing a variable gain to an input signal.

Yet a further aspect of the invention provides an apparatus for detecting the envelope of a signal comprising: a first transistor, wherein the gate terminal of said first transistor is coupled to said signal; a capacitor having a first terminal coupled to the source terminal of said first transistor, and a second terminal coupled to a ground voltage; a second transistor, wherein the drain terminal of said second transistor is coupled to said first terminal of said capacitor, and the gate terminal of said second transistor is coupled to a control voltage; wherein the detected envelope of said signal comprises the voltage across said capacitor. Also provided are various means and methods for envelope detection.

BRIEF DESCRIPTION OF FIGURES

FIG. 1 shows a specific embodiment of the pre-distorter in a power amplifier in a radio transmitter.

FIG. 2 shows a power coupler for use with a memory compensator aspect of the pre-distorter.

FIG. 3 shows an overview of the internal system architecture of an embodiment of the RFPAL 101 shown in FIG. 1.

FIG. 4 shows the portion of the RFPAL 101 corresponding to the pre-distortion block 302 and error signal generator block 303 shown in FIG. 3.

FIGS. 5, 5A, 5C, and 5D show preferred embodiments of the envelope detectors 408 and 413 shown in FIG. 4. FIG. 5B shows an implementation of the square root generator block shown in FIG. 5A.

FIG. 6A shows an RC-CR implementation of the quadrature phase generator.

FIG. 6B shows a phase-shifter implemented using a Hilbert transformer.

FIG. 6C shows a quadrature phase generator implemented using an active LC network circuit.

FIG. 6D shows a modified active LC circuit wherein the capacitance C is adjustable by configuring a set of switches connected to a series of capacitors 630.

FIG. 6E shows an embodiment of the input stage block 601 in FIG. 6C.

FIG. 6F shows an implementation of one of the transconductors G1 or G2 in the differential gyrator 604 shown in FIG. 6C.

FIG. 6G shows a modified version of the transconductor circuit shown in FIG. 6F.

FIG. 6H shows yet another possible embodiment of a quadrature-phase generator known as an injection-locked quadrature generator.

FIG. 7 shows an implementation of the Q polynomial function synthesizer 402.2 in FIG. 4.

FIG. 8 shows a preferred implementation of the RF variable-gain amplifiers (VGA) 405.1 and 405.2 in FIG. 4.

FIG. 8A shows an alternative capacitor arrangement for one of the transconductors in the VGA shown in FIG. 8.

FIG. 8B shows a circuit implementation of the transconductors G1, G2, G3, and G4.

FIG. 8C shows an alternative circuit implementation of the transconductors G1, G2, G3, and G4, utilizing both NMOS and PMOS transistors.

FIG. 9 shows an implementation of the error signal generator block 303 shown in FIG. 4.

FIG. 10 shows an implementation of the Adapt P block 403.1.

FIG. 10A shows some of the functionality of a microprocessor used in the pre-distorter.

FIG. 11 shows one possible architecture of the work function generator 1006 in FIG. 10.

FIG. 11A shows an alternative implementation of the work function generator to decrease the number of adders and multipliers from the architecture shown in FIG. 11.

FIG. 12 shows a preferred set of weights w for each polynomial analysis work function Φ_(i), according to the notation defined in FIG. 11.

FIG. 13 shows a preferred embodiment of a low-pass filter for use in the Adapt P block shown in FIG. 10.

FIG. 14 shows the linear transformations that can be performed by the microprocessor 1010 shown in FIG. 10A.

FIG. 15 shows an embodiment of a memory compensator that operates on two signals 1501 and 1502.

FIG. 16A shows an embodiment of the pre-distorter in the RF front-end of a radio receiver.

FIG. 16B shows an embodiment of the pre-distorter in a high-speed analog-to-digital converter (ADC).

DETAILED DESCRIPTION

In this specification and in the claims, it will be understood that when an element is referred to as being “connected to” or “coupled to” another element, it can be directly connected or coupled to the other element or intervening elements may be present. In contrast, when an element is referred to as being “directly connected to” or “directly coupled to” another element, there are no intervening elements present.

Denote the input signal to a non-linear component (NLC) by a signal s(t), which can be expressed as: s(t)=r(t) cos (ω_(c) t+p(t)),

where r(t) represents the time-dependent amplitude (whose absolute value corresponds to the envelope) of the input signal, ω_(c) is the carrier frequency in radians, and p(t) represents a time-dependent phase term. In the absence of a pre-distorter, the NLC will generally introduce AM-AM (amplitude-to-amplitude) and AM-PM (amplitude-to-phase) non-linear distortion to this signal as follows: NLC(s(t))=G[r(t)] cos (ω_(c) t+p(t)+B[r(t)]),

where G represents the AM-AM distortion, and B represents the AM-PM distortion.

To correct this non-linearity, the signal s(t) can be first processed to generate a pre-distorted signal y(t) given by:

$\quad\begin{matrix} {{y(t)} = {{\sum\limits_{i = 1}^{N}\;{p_{i}r^{i}{\cos\left( {{\omega_{c}t} + {p(t)}} \right)}}} +}} \\ {{\sum\limits_{i = 1}^{N}\;{q_{i}r\;{\sin\left( {{\omega_{c}t} + {p(t)}} \right)}}},} \end{matrix}$

where p_(i) and q_(i) represent coefficients for weighting each basis function r^(i) cos (ω_(c)t+p(t)) and r^(i) sin (ω_(c)t+p(t)), respectively. (Note that for simplicity of notation, the time dependence of r has been omitted from the preceding equation.) The pre-distorter output signal y(t) can then be input to the NLC to produce: NLC(y(t))=G′[r(t)] cos (ω_(c) t+p(t)+B′[r(t)])  (Eq. 1)

where G′ and B′ represent the composite AM-AM and AM-PM distortion, respectively, of the combination of the pre-distorter and NLC. In designing a pre-distorter, then, it is seen that the coefficients p_(i) and q_(i) should be chosen such that the composite functions G′ and B′ introduce as little non-linear distortion as possible to the signal s(t).

Turning now to a specific embodiment, FIG. 1 shows a pre-distorter for a power amplifier in a radio transmitter. One of ordinary skill in the art will recognize that the pre-distorter need not be applied as shown in FIG. 1, but may be used in conjunction with any NLC to improve the distortion characteristics of the NLC output signal. In particular, the pre-distorter can operate at baseband, intermediate frequency (IF), or radio frequency (RF). The pre-distorter can be used not only in base station transceivers as shown in FIG. 1, but in mobile and other types of transmitters or receivers (e.g., to linearize the output signal of a low-noise amplifier (LNA) or mixer in the receive chain). Illustrative embodiments of such alternative applications will be described later with reference to FIGS. 16A and 16B.

In FIG. 1, a baseband combiner 110 can digitally combine the signals from a series of digital modems 112. The combiner 110 can output an in-phase signal (I) 110 a and a quadrature-phase signal (Q) 110 b which can be converted into analog signals by the DACs 113.1 and 113.2. The analog I and Q output signals 113.1 a and 113.2 a can then be input to an RF transceiver 111 which modulates the I and Q signals onto an RF carrier frequency f_(c), by multiplying the I and Q signals with a carrier signal generated by a VCO 120. The output signal 111 a of the RF transceiver 111 can be further processed by the pre-processor block 114, which may perform such operations as filtering and pre-amplification of the signal 111 a.

The output signal 114 a of the pre-processor block 114 can be input to a power coupler 115, which splits an input signal into multiple output signals. In one embodiment, the power coupler 115 splits the output signal 114 a of the pre-processor block 114 into two output signals 115 a and 115 b, as shown in FIG. 1. In the preferred embodiment, the signal 114 a may be of power 3 dBm, and signals 115 a and 115 b can be 0 dBm each. The output signal 115 a may be input directly to the Radio Frequency Power Amplifier Linearizer (RFPAL) block 101, and may serve as the datapath signal to be pre-distorted according to the algorithms described herein. The other output signal 115 b may be input to a coarse delay block 116, which can delay a signal 115 b by a pre-determined time period, and then be input to the RFPAL 101 as the delayed signal 116 a.

The delay of the coarse delay block 116 may be chosen to approximate the delay of the Power Amplifier 107. In one embodiment, the Power Amplifier 107 is a 6S21140 LDMOS RF power field effect transistor (FET), available from Freescale Semiconductor, and the coarse delay block 116 delays the signal 115 b by about 5.9 ns. One of ordinary skill in the art will realize that the coarse delay block 116 may be a stand-alone component delay block, or an incorporated component delay block of the RFPAL integrated circuit (IC). Note that in one embodiment, a delay-locked loop (DLL) may also be incorporated in the RFPAL 101 to further adjust the relative delay between the power amplifier output signal 107 a and the reference signal 116 a. One of ordinary skill in the art will also recognize that the coarse delay block 116 may even be omitted if any resulting degradation in performance is deemed tolerable, eg, if the delay of the PA 107 is negligible.

In an alternative embodiment of the pre-distorter, the power coupler 115 can split the output signal 114 a of the pre-processor block 114 into four output signals 115 a, 115 b, 115 c, and 115 d, as shown in FIG. 2. In this embodiment, output signals 115 c and 115 d may be input to memory delay blocks 116.1 and 116.2, respectively, and then input to the RFPAL 101 as signals 116.1 a and 116.2 a. Memory delay block signals 116.1 a and 116.2 a may be used in a memory compensator 304 in the RFPAL 101, to be described with reference to FIG. 3. The memory compensator 304 can generate pre-distorted versions of the signals 116.1 a and 116.2 a to correct distortion due to memory effects exhibited by the PA 107 by adding the pre-distorted versions of the signals to the distorted output signals (a memory compensator is also called a memory compensation summer). For this reason, the memory delay blocks 116.1 and 116.2 may be designed to introduce delays that approximate the PA memory delays. The internal architecture of the memory compensator 304 will be described later in the specification.

Referring back to FIG. 1, the RFPAL 101 may internally compare the delayed signal 116 a to an attenuated version 105 a of the RF power amplifier output signal 107 a to generate an error signal for driving the adaptive pre-distortion algorithms of the RFPAL 101. The RFPAL 101 may output a pre-distorted signal 101 a, which can be input to a pre-amplifier 106, and then to the power amplifier 107. The power amplifier output signal 107 a can be input to a coupler 104, which splits the signal 107 a into two signals 104 a and 104 b. The signal 104 a can then be input to the duplexer 103, and be transmitted over the radio channel using the antenna 102. The signal 104 b can be input to an attenuator 105 and fed back to the RFPAL 101 as signal 105 a, as earlier described.

FIG. 3 shows an overview of the internal system architecture of an embodiment of the RFPAL 101 shown in FIG. 1. One of ordinary skill in the art will recognize that the labeled blocks show only conceptualized divisions of the sub-functions of the RFPAL. Alternative logical and physical divisions of the sub-functions of the RFPAL also fall within the scope of the pre-distortion apparatus. For example, the pre-distortion block 302 and error signal generator block 303 may be implemented as one composite physical block.

The RFPAL 101 from FIG. 1 is similarly labeled as 101 in FIG. 3. Signal 115 a can serve as the datapath signal to be pre-distorted by the pre-distortion block 302. Signal 116 a, a delayed version of signal 115 b, is input to the error signal generator block 303. The signal 116 a can be referred to as the reference signal. The error signal generator block 303 can also receive as input an attenuated version 105 a of the power amplifier output signal 107 a. The signal 105 a can be referred to as the feedback signal. The error signal generator block 303 compares reference signal 116 a to feedback signal 105 a to generate an error signal e(t) 303 a, which is used to drive the adaptive pre-distortion algorithm in the pre-distortion block 302. The output signal 101 a of the pre-distortion block 302 can be input to the PA 107. The output signal 101 a can be referred to as the (buffered) pre-distorted signal.

In one embodiment, signals 116.1 a and 116.2 a can be input to a memory compensation block 304.

The RFPAL 101 may also comprise a microprocessor 305, which executes code stored in an electrically erasable programmable read-only memory (EEPROM) 306. The microprocessor functions may comprise, for example, accepting a signal 302 b from the pre-distortion block 302 indicative of the datapath signal 115 a's signal strength, and outputting signals 305 a and 305 b to adjust the gate bias 308 and drain bias 309, respectively, of the power amplifier 107.

The RFPAL 300 may also comprise a bandgap voltage reference 307 to provide a reference voltage for the on-chip circuitry.

FIG. 4 shows the portion of the RFPAL 101 corresponding to the pre-distortion block 302 and error signal generator block 303 shown in FIG. 3. A functional description of the blocks shown in FIG. 4 is now given, with an architectural description of the blocks to be given later in the specification. In the embodiment shown in FIG. 4, the datapath, reference, feedback, and pre-distorted signals are all real signals, i.e., signals having real amplitudes. One of ordinary skill in the art will recognize that the pre-distorter can also be described and implemented using complex signals, i.e., signals having both real and imaginary components.

As shown in FIG. 4, signal 115 a from the power coupler 115 in FIG. 3 is input to an RF buffer 411, which outputs a buffered signal 411 a. Signal 411 a is then input to a 0/90-degree quad phase generator 401. The phase generator 401 outputs a 0-degree phase-shifted (in-phase, or “I”) version of signal 411 a as signal 401 a, and a 90-degree phase shifted (quadrature-phase, or “Q”) version of signal 411 a as signal 401 b. Note hereinafter, with respect to FIG. 4, components specific to the in-phase (I) processing path will be denoted by a 0.1 appended to the block number, and components specific to the quadrature-phase (Q) processing path will be denoted by a 0.2 appended to the block number. For example, 402.1 denotes the work function generator for the I path, while 402.2 denotes the work function generator for the Q path. As the processing of the in-phase signal can be identical to the processing of the quadrature-phase signal, and the components used for the I path can be identical to those used for the Q path, only the processing of the I signal will be described herein for simplicity.

The buffered signal 411 a is also input to an envelope detector 408, which removes the RF component of the signal as well as the sign of the amplitude, and thus outputs a datapath envelope signal 408 a that tracks the envelope of the buffered datapath signal 411 a. The envelope signal 408 a is input to the P polynomial function synthesizer block 402.1. From the datapath envelope signal 408 a, the P poly func block 402.1 can generate a set of synthesis work functions. These work functions may be weighted by the coefficients 403.1 c supplied by the Adapt P block 403.1. The weighted work functions may be summed to give a synthesized function 402.1 a. Block 402.1 may also be referred to as a synthesizing function generator.

The synthesized function 402.1 a is used by the RF variable-gain amplifier (VGA) 405.1 to modulate the gain of the I signal 401 a, resulting in the pre-distorted I signal 405.1 a. The RF VGA 405.1 thus effectively multiplies the synthesized function 402.1 a with the I signal 401 a.

Signal 405.1 a can then be summed with signal 405.2 a, generated by a corresponding set of Q-phase components (ie, 403.2, 402.2, and 405.2), by the RF summer 407. The RF summer output signal 407 a, which is referred to as the unbuffered pre-distorted signal, can be buffered by RF buffer 409 to produce a buffered pre-distorted signal 101 a. In one embodiment of the RFPAL, the buffered signal 101 a may be directly output to the off-chip power amplifier 107. In an alternative embodiment, the output signal 101 a may first be input to an automatic gain control (AGC) circuit (not shown), whose gain may depend on the detected envelope of the power amplifier output signal 107 a. The AGC output signal may then be supplied to the PA 107. This feature can be used to correct for any variations in the gain of the PA 107 that might be caused by, for example, variations in the supply or bias voltages of the PA 107.

As noted earlier, the Adapt P block 403.1 supplies the set of adaptive coefficients 403.1 c to the P polynomial function synthesizer block 402.1. The adaptive coefficients 403.1 a may be computed according to an adaptive algorithm designed to minimize the error difference 303 a, or e(t), between signal 116 a and a scaled, buffered version 415 a of the PA output signal 107 a. In particular, the adaptive coefficients 403.1 c may comprise an optimal set of weights for weighting a chosen set of work functions. Embodiments of the adaptive algorithm, as well as preferred choices of basis functions, will be described in detail later in this specification.

To drive the adaptive algorithm, the Adapt P block 403.1 may accept as input signals the reference envelope signal 413 a of the buffered reference signal 412 a, the in-phase component 414 a of the buffered reference signal 412 a, and the error signal 303 a or e(t) generated by the error signal generator block 303. The Adapt P block 403.1 may also accept configuration parameters 403.1 b, such as the weights used to construct the basis functions from a set of monomial functions, from the Microprocessor 305 shown in FIG. 3. The Adapt P block 403.1 may provide a signal 403.1 a, which may include the adaptive coefficients p_(i) and q_(i) (later discussed with reference to the Adapt P block and Adapt Q block in FIG. 10), to the Microprocessor 305.

In a preferred embodiment, the Adapt P block 403.1 may be configurable such that the correlation coefficients are “frozen,” i.e., not updated, in response to an indication that the power of the pre-distorted signal exceeds a pre-determined threshold. In one implementation, this can be done by selectively setting μ=0 during those times when said indication is present. Unfreezing can then be achieved by setting μ to the value it had prior to its being set to 0. In an alternative embodiment, the signal 1003 a can be saturated if it exceeds a certain threshold value.

Note the components labeled “RF” in FIG. 4, and described as “RF” in this specification, refer to RF signals in an embodiment wherein the pre-distorter is applied to an RF transmitter. In a preferred embodiment, the pre-distorter can be used to linearize RF signals by performing operations entirely at RF, thus providing a modular “drop-in” solution for non-linear RF components such as power amplifiers. However, one of ordinary skill in the art will recognize that the pre-distorter need not operate at RF. Rather, it can operate at any frequency, including IF or baseband, depending on the application. Such embodiments also fall within the scope of the pre-distortion apparatus.

Note also that the processing circuitry shown in FIG. 4 is split into a set of I components (denoted by suffix .1) and a set of Q components (denoted by suffix .2) for processing the I and Q signals, respectively, generated by quad phase generator 401. However, one of ordinary skill in the art will recognize that the same functionality described can be achieved using a single composite set of components for processing complex signals.

For example, it can be seen that the operations performed by the two VGA's 405.1 and 405.2 and the RF summer 407 essentially comprise two multiplications and one addition: one multiplication between the I signal 401 a and the synthesized I function 402.1 a, one multiplication between the Q signal 401 b and the synthesized Q function 402.1 b, and one addition between those two products. These operations can alternatively be described as taking the real part of the product of a complex multiplication, wherein the first complex multiplicand comprises a real part 401 a and an imaginary part 401 b, and the complex conjugate of the second complex multiplicand comprises a real part 402.1 a and an imaginary part 402.1 b. The real part of the product of such a complex multiplication will correspond to the signal 407 a. Thus, the pre-distorter can be implemented and/or described using either real or complex functions and components, and both implementations fall within the scope of the disclosed pre-distortion apparatus.

The details of the individual blocks of the RFPAL shown in FIG. 4 will now be described.

FIG. 7 shows an implementation of the Q polynomial function synthesizer 402.2 in FIG. 4. The same implementation can be used in the P polynomial function synthesizer 402.1 in FIG. 4. The Q polynomial function synthesizer 402.2 can accept as one input signal the envelope signal 701 (which can correspond to signal 408 a in FIG. 4), also denoted r in FIG. 7. The generator 402.2 can also input the coefficients 403.2 c comprising signals b₁, b₂, b₃, and b₄, which are supplied by the Adapt Q block 403.2 in FIG. 4. The signals b₁, b₂, b₃, and b₄ represent the set of adaptive coefficients computed by the Adapt Q block 403.2. According to the operations shown in FIG. 7, the output signal 703 can be expressed as b₄r³+b₃r²+b₂r+b₁. This output signal 703 can be referred to as the weighted sum of the synthesis work functions.

One of ordinary skill in the art will recognize that alternative architectures to the one shown in FIG. 7 may be used to generate basis polynomials from a set of monomials, including architectures employing Horner's method. Such alternative architectures are also within the scope of the pre-distortion apparatus.

FIG. 10 shows an implementation of the Adapt P block 403.1. The Adapt Q block 403.2 shown in FIG. 4 may be implemented in a similar manner. In one embodiment, the Adapt P and Adapt Q blocks may be implemented as one logical block with two instances of the circuitry shown in FIG. 10.

As described earlier, the Adapt P block 403.1 can accept as inputs an RF signal e(t) 1001, which can correspond to the error signal 303 a generated by the error signal generator 303 in FIG. 4, and an RF signal P1 1002, which can correspond to the I component 414 a of the reference signal 116 a shown in FIG. 4. Furthermore, the Adapt P block 403.1 can accept as input a baseband signal r1 1007, which can correspond to the reference envelope signal 413 a generated by the envelope detector 413 shown in FIG. 4. The Adapt P block 403.1 can also accept as parameter inputs a set of coefficients w, collectively labeled 1009, corresponding to the coefficients used to construct the work functions for the adaptive algorithm. These coefficients 1009 may be supplied by a microprocessor 1010, shown in FIG. 10A. After performing the adaptive algorithm, the Adapt P block 403.1 can output a set of coefficients p₁, . . . p_(i), . . . , p_(N), labeled in FIG. 10, collectively denoted 1011 in FIG. 10A. These coefficients can be converted to digital form by the ADC's 1020.i, and then be inputted to the microprocessor 1010. The microprocessor 1010 can convert the coefficients 1011 to a set of monomial function coefficients 1012, which can then be input to the P poly function generator 402.1 as coefficients 403.1 c shown in FIG. 4. Digital-to-analog converters (DAC's) 1030.i may be used to convert the digital signals from the microprocessor 1010 to analog signals.

The architecture of the work function generator 1006 will now be described. The work function generator 1006 synthesizes a set of N analysis work functions 1006.1, . . . , 1006.i., . . . , 1006.N. Here, the variable i is an index (from 1 to N) to an arbitrary one of the N work functions. The embodiment shown in FIG. 11 depicts an embodiment wherein N=4. FIG. 11 depicts the work function generator 1006 inputting the reference envelope signal r₁ 1007, and generating raised powers r₁ ², . . . , r₁ ^(N-1) of signal 1007 using multipliers 1101 and 1102 successively. In this specification and in the claims, a “raised power” of an envelope signal refers to a signal whose amplitude corresponds to the envelope signal's amplitude raised to an exponential power. For example, “the N raised powers of the reference envelope signal r₁” may refer to the signals r₁ ⁰ (or 1), r₁ ¹ (or r₁), r₁ ², . . . , r₁ ^(N-1) with r₁ ⁰ corresponding to a DC term, and r₁ ¹ corresponding to the original envelope signal r₁ 1007.

As shown in FIG. 11, the work function generator can weight (multiply) each raised power of the reference envelope signal by a coefficient w_(ij) (where j indexes the raised power of the envelope signal, and ranges from 0 to N−1) and the weighted raised powers may be summed over j to produce a plurality of polynomial work functions 1006.i. Each work function 1006.i is thus seen to be a linear combination of raised powers of the reference envelope signal r₁ 1007.

In the embodiment shown in FIG. 11, there are N work functions generated from four raised powers of the reference envelope signal. One of ordinary skill in the art will recognize that the pre-distorter is not limited to only four raised powers of the envelope signal. The pre-distorter encompasses any number of raised powers of the envelope signal. Furthermore, the pre-distorter is not limited to only four work functions generated from four raised powers—the number of work functions N may be more than the number of raised powers, allowing for a set of dependent, rather than independent, vectors.

In a preferred embodiment, four work functions (i.e., N=4) are generated from four raised powers of the envelope signal, and each work function consists of one of the four monomials 1, r₁, r₁ ², r₁ ³. In another preferred embodiment, four work functions Φ_(i) are generated from four raised powers of the reference envelope signals, each polynomial Φ_(i) comprising a weighted sum (i.e., a linear combination) of the monomials 1, r₁, r₁ ², . . . , r₁ ^(N-1). The RMS value of each work function can be set to 1 Volt in a preferred embodiment. A preferred set of weights w for each polynomial Φ_(i), chosen for the case where the power level of the signal input to the RFPAL is 0 dBm, is shown in FIG. 12, with the weights defined according to the work function generator shown in FIG. 11.

In general, the work functions may be chosen to be orthogonal to each other, and thus may be constructed according to procedures known to those of ordinary skill in the art, such as Gram-Schmidt orthogonalization or the Cholesky method.

In a preferred embodiment, the work functions may be chosen as follows to help speed up convergence of the adaptive algorithm. In particular, define a column vector [1, r₁, r₁ ², r₁ ³]^(T) as a monomial basis function vector. Define the expectation of the outer product of this vector (i.e., E{[1, r₁, r₁ ², r₁ ³]^(T)·[1, r₁, r₁ ², r₁ ³]}) as the auto-correlation matrix. The work functions may be chosen to reduce the eigenvalue spread of this autocorrelation matrix. In practice, the autocorrelation matrix can be approximated by taking the long-term averages of the outer product of the monomial basis function vector. Note that according to this embodiment, the coefficients for both the analysis and synthesis work functions may be derived once and stored in memory for later use, or they may be continuously updated, eg, every 100 ms, to account for variations in the power level of the input to the RFPAL.

To decrease the number of adders and multipliers needed to implement the work function generator 1006, the alternative architecture shown in FIG. 11A may be employed. This architecture generates four functions r₁ ³+w₄r₁ ²+w₅r₁ ¹+w₆, r₁ ²+w₂r₁+w₃, r₁ ¹+w₁, and 1 as signals 1006.4, 1006.3, 1006.2, and 1006.1. Since these functions are generally not normalized with respect to each other, an additional set of gains m 1401 could be applied to normalize the coefficients p 1403 during post-processing by the microprocessor 1010, as shown in FIG. 14. Note however that according to the pre-distorter, the work functions need not be normalized, and may have unequal powers depending on the choice of gains m 1401 shown in FIG. 14.

Referring back to FIG. 10, each work function 1006.i is separately multiplied with the signal 1004 a using a multiplier 1005.i to generate an output signal 1005.ia. Each signal 1004 a comprises the error signal e(t) 1001 multiplied by the in-phase component p1 1002 of the reference signal, and then low-passed filtered by LPF1 1004. The LPF1 1004 contributes a gain G₁. Each output signal 1005.ia is then passed through a corresponding low-pass filter (LPF2) 1007.i, generating an output signal 1007.ia. The LPF2 1007.i contributes a gain G₂. An amplifier 1008.i, which contributes a gain of G₃, amplifies each output signal 1007.ia to generate a coefficient p_(i). In an embodiment, the bandwidths of both LPF1 and LPF2 can be 400 MHz. P One of ordinary skill in the art will recognize that the gain μ of the adaptive algorithm can generally be expressed as: μ=T·G ₁ ·G ₂ ·G ₃

In the preferred embodiment, μ is chosen as a value between 1.25×10^−6 and 2.5×10^−6, in order to yield good convergence speed and offset-insensitivity. If T is chosen to be in the range 30-50, as previously described, then the remaining gain terms can be distributed evenly among the terms G₁, G₂, and G₃. Alternatively, the low-pass filter gains G₁ and G₂ can be set to equal to each other, and the amplifier gain G₃ can provide the necessary residual gain.

As the operations shown in FIG. 10 are all linear, each coefficient p_(i) effectively comprises the result of correlating the signal e(t) 1001 with an analysis basis function defined as: r₁·Φ_(i) cos [ω_(c)(t−d)+p(t−d)],

where d represents the delay introduced by the coarse delay block 116 in FIG. 1. In this specification and in the claims, the operations of multiplying two signals, then low-pass filtering the product, may collectively be referred to as “correlating” the two signals. In general, the basis functions may be chosen to approximately span the inverse of the function space to which an NLC maps an input signal. The basis functions in turn dictate the choice of coefficients w 1009 for the work functions 1006.i. In this specification and in the claims, a “basis function” is equivalent to a work function (which is generally a polynomial function of an envelope signal) multiplied by a signal carrying the original phase and amplitude. Thus, the orders of the monomials in a work function polynomial are generally one less than the orders of the monomials in a corresponding basis function polynomial.

Architectures for LPF's 1004 and 1007.i are well-known to those of ordinary skill in the art. A preferred embodiment of an LPF is shown in FIG. 13.

In an embodiment of the pre-distorter wherein the error signal generator 303 generates an error signal e(t) 303 a equal to tanh [T·diff], each coefficient p_(i) output by an LPF 1007.i can be ideally expressed as:

p_(i)(t 0) = μ∫₀^(t 0){(Φ_(i)cos  Θ)tanh  [ T ⋅ diff]}𝕕t

where t0 is a time index, Φ_(i) is the generalized polynomial work function 1006.i, and Θ is the phase component (including the carrier) of the signal 411 a in FIG. 4. Similarly, in an embodiment of the Adapt Q block 403.2, each coefficient q_(i) can be expressed as:

q_(i)(t 0) = μ∫₀^(t 0){(Φ_(i)cos  Θ)tanh  [ T ⋅ diff]}𝕕t

Note that since the coefficients p₁, . . . , p_(N) are the correlated output signals of each analysis basis function, which can in general be polynomial functions of the reference envelope signal, a further linear transformation needs to be performed to derive a set of coefficients a₁, . . . , a_(N) which can be directly multiplied with the monomials r⁰, r¹, r², and r³ (where r corresponds to the datapath envelope signal) generated in the P and Q polynomial function synthesizers 402.1 and 402.2 shown in FIG. 4. This linear transformation can be performed by the microprocessor 1010 shown in FIG. 10A according to the operations in FIG. 14.

In a preferred embodiment, the synthesis work functions are constructed from the same weights as used to construct the analysis work functions in the work function generator 1006 of FIG. 10. One of ordinary skill in the art will recognize that in general the analysis work functions need not be identical to the synthesis work functions, and may be different if desired, e.g., to correct for any systematic bias in the system. In such an alternative embodiment, the linear transformations described below may be altered accordingly.

According to this preferred embodiment, FIG. 14 shows a matrix 1402 wherein each row corresponds to the monomial weights of a single analysis function 1006.i as defined in FIG. 11. This assumes that the synthesis work functions are identical to the analysis work functions. One of ordinary skill in the art will recognize that the pre-distorter also encompasses embodiments wherein the synthesis work functions are different from the analysis work functions. Multiplying the diagonal matrix 1401 with matrix 1402 effectively applies a gain m_(i) to each row of 1402. The product is then multiplied by the vector 1403, which weights each coefficient of each basis function (times m_(i)) with a coefficient p_(i) derived from the adaptive algorithm, and sums the weighted coefficients. In a preferred embodiment, a vector of offsets n 1404 may be added to compensate for any offsets in the system. These offsets n 1404 may be all zero in the preferred embodiment. The resulting vector 1405 can be input to the P Polynomial function synthesizer block 402.1 as the coefficients 402.1 a. Similar operations can be performed for the Q coefficients q_(i). One of ordinary skill in the art will recognize that the linear transformation shown in FIG. 14 can be easily extended to systems using more than four basis functions. One of ordinary skill in the art will also recognize that the linear transformation can be performed not only by a microprocessor, but by a variety of other means including analog circuitry or amplifiers.

One of ordinary skill in the art will also recognize that various options may be selected simply by configuring the linear transformation shown in FIG. 14. For example, the adaptation may be disabled for a period of time, and a fixed set of coefficients may be supplied to the synthesis work function generators, by setting the gains m 1401 to all zero, and setting the vector n to be equal to the static coefficient values. Or, depending on appropriate selection metrics, some of the work functions may be selectively disabled by setting the corresponding gains m 1401 to zero.

Note a preferred embodiment of the pre-distorter can utilize a memory compensation block 304 as shown in FIG. 3 to correct for distortion caused by memory effects exhibited by the PA 107. In particular, an NLC with memory effects generates an output signal NLC_(memory) that can be modeled as:

NLC_(memory)(s(t)) = NLC(s(t)) + NLC(s(t − t₁)) + … + NLC(s(t − t_(M)),

where NLC( ) represents the functional transformation performed on an NLC input by an NLC without memory effects, as described earlier in (Eq. 1), and t₁, . . . , t_(M) represent the delays introduced by an NLC with memory effects.

To correct for the distortion arising from an NLC with memory effects, FIG. 15 shows a memory compensator 304 which can utilize the adaptive algorithms described earlier to pre-distort output signals 1501 and 1502, which can correspond to delayed versions of pre-distorted 116.1 a and 116.2 a respectively of the datapath signal 115 a shown in FIG. 2. The delays of signals 116.1 a and 116.2 a may be chosen to approximate the two most significant PA memory delays. One of ordinary skill in the art will recognize that the memory compensator is not limited to only two delayed signals, but in general can be applied to an arbitrary number of delayed signals by simply scaling the architecture described herein.

One of ordinary skill in the art will also recognize that each instance 1504 and 1505 of the adaptive linearizer has been simplified with respect to the implementation described in FIG. 4. In particular, both the analysis functions and the synthesis functions for 1504 are generated from the same envelope detector output 1504.3 a, which works well in general if the PA delay is small, as described earlier. The memory compensator nevertheless encompasses implementations where a coarse delay block such as 116 is used. Furthermore, various signals such as P1 and Q1 of FIG. 4 are not shown in FIG. 15 for simplicity of presentation. The memory compensator can in general use all of the features disclosed in this specification for the design of the constituent instances of the adaptive linearizer (shown as 1504 and 1505 in FIG. 15), and thus the scope of the memory compensator should not be construed as being limited to that shown in FIG. 15.

In FIG. 15, signal x 1501 may be the delayed signal 116.1 a in FIG. 2, and signal y 1502 may be the delayed signal 116.2 a. FIG. 15 shows that signals x 1501 and y 1502 can each be processed by an independent instance 1504 or 1505 of the same architecture used for the datapath signal 115 a in FIG. 4. Instances 1504 and 1505 can share the same error signal e(t) 303 a as generated by the error signal generator 303 in FIG. 4. In general, as long as each memory-delayed signal is sufficiently uncorrelated with other memory-delayed signals, then the adaptive algorithm of each instance of the pre-distortion architecture will act to minimize the distortion error of a single memory-delayed signal independently of other memory-delayed signals. Note therefore that poorer performance may result when the memory-delayed signals are highly correlated with each other, eg, if the memory delays of the non-linear component are much less than the inverse of the signal bandwidth.

Note also that the analysis work functions generated internally by the Adapt P blocks 1504.1 and 1505.1 and Adapt Q blocks 1504.2 and 1505.2 should be generated from the envelope signals of the delayed input signals x 1501 and y 1502. The output signal 1504 a of the instance 1504 may be summed with the output signal 1505 a of the instance 1505 to arrive at an output signal z 1503. This signal z can be added to the main datapath signal 407 a by an RF summer (not shown) to generate a composite pre-distorted signal that corrects for the memory effects associated with two PA memory delays.

In a further embodiment of the memory compensator, the delays of the memory effects could also be accounted for using DLL tracking, in addition to being approximated by the delays associated with the coarse delay blocks 116.1 and 116.2. In such an embodiment, a DLL can be used to lock, e.g., the signal 1501 to the residual error of the main adaptation, i.e., the difference between (Σ_(i) p_(i)*analysis basis functions) and the error signal. This would be a decision feedback embodiment of the memory compensator, and allow the delay components of the memory compensator to better approximate the actual memory delays of the non-linear component.

One of ordinary skill in the art will recognize that the functions used to perform the correlation and the functions used to synthesize the pre-distorted (delayed) signal generally need NOT be the same functions. Rather, they may be delayed relative to each other by the PA delay, analogous to the case for the main datapath signal and the reference signal. Thus the coarse delay 116.1 may be split into two smaller delays, one of which is the PA delay currently used for 116, and one of which is the actual delay corresponding to a PA memory delay. In this case, then, the older signal may be used to perform the adaptation, while the newer signal may be used to perform the synthesis. One of ordinary skill in the art will recognize that if the PA delay (approximated by block 107 in FIG. 1) is significantly less than the PA memory delays (approximated by blocks 116.1 and 116.2), then satisfactory performance may be achieved even if the blocks 116.1 and 116.2 are not sub-divided into smaller delays. In fact, if the PA delay is negligible, the coarse delay block 116 may be omitted altogether without substantially compromising the performance of the adaptive algorithms.

FIGS. 16A and 16B show alternative embodiments of the predistortion apparatus. FIG. 16A depicts an embodiment of the predistortion apparatus 1605 (labeled “ARFL” for adaptive RF linearizer) used to linearize the output signal 1602 of the RF front end 1607 (labeled “RFFE”) in a receiver chain. As shown in the diagram, the ARFL 1605 inputs an RF signal 1602 (non-linearly distorted by the RFFE 1607), a reference signal 1604 corresponding to a delayed version of the input signal 1601 to the RFFE 1607, and outputs a corrected (ideally distortion-free) signal 1608.

FIG. 16B depicts an embodiment of the predistortion apparatus 1611 (labeled “GAL” for Gigabit Adaptive Linearizer) used to linearize the analog-to-digital mapping of the analog-to-digital converter (ADC) block 1615. The GAL 1611 receives as input a gigabit analog signal 1610, a reference signal 1614 corresponding to the analog output signal of the digital-to-analog converter (DAC) 1616, and outputs a pre-distorted signal 1612.

Circuit Implementations

Various possible circuit implementations of the blocks of the pre-distortion apparatus will now be described in detail. These descriptions are meant to be illustrative only, and are not meant to limit the scope of the pre-distortion apparatus to any particular circuit implementation herein disclosed.

Envelope Detector

FIG. 5 shows a preferred embodiment of the envelope detectors 408 and 413 shown in FIG. 4. The envelope detector takes an input signal 501, and outputs an envelope signal 510 that is a low-pass filtered version of the absolute value of the input signal 501. The bandwidth of the low-pass filter may be adjusted by adjusting the capacitance C1 of the capacitor 504. In a preferred embodiment, the capacitance C1 is chosen in conjunction with the output resistance of the current source I1 in FIG. 5 to provide for a bandwidth of about 20 MHz.

An alternative embodiment of the envelope detector known as an “orthogonal peak detector” is shown in FIG. 5A. In this embodiment, an in-phase component signal 513 and a quadrature-phase component signal 512 are generated from the input signal 511. Component signals 512 and 513 are squared using multipliers 512.1 and 513.1, respectively. The squared signals are summed using adder 515.1, to give a squared envelope signal 516, from which the square root generator 516.1 generates the envelope signal 517. Note that in a preferred embodiment, the quadrature generator 511.1 has nominally unity gain, and any actual difference from unity gain may be compensated in the non-quadrature path by applying a corresponding gain using an amplifier (not shown). Note the gain of such an amplifier may be compensated for elsewhere in the signal path, eg in the RFVGA.

FIG. 5B shows an embodiment of the square root generator 516.1 in FIG. 5A. In a preferred embodiment, the amplifier 522 is a voltage amplifier with high input impedance and low driving point output impedance. Furthermore, the amplifier gain need not be large unless the resistors 523 and 524 are not well-matched.

FIG. 5C shows an alternative embodiment of an envelope detector, known as a “diode peak detector.” This embodiment comprises a transconductor 531, a diode 532, a capacitor 533, and a voltage amplifier 534. The transconductor 531 accepts as input signals the envelope detector input signal 530 and the envelope detector output signal 539. When signal 539 is greater than signal 530, the transconductor 531 generates current in the direction of arrow 531.1, which forward biases the diode 532 to charge the capacitor 533. When signal 539 is less than 530, the transconductor 531 outputs current in the direction against the arrow 531.1, thus reverse-biasing the diode 532, and preventing any current from the transconductor 531 from discharging the capacitor 533. Thus, the combination of the diode 532 and capacitor 533 functions as a rectifier. As amplifier 534 is configured to be a unity gain buffer, signal 539 follows the voltage across the capacitor 533.

Ideally, no external resistance is required for the capacitor 533 to discharge, as the inherent terminating input resistances of the voltage amplifier 534 may be utilized. In a preferred embodiment, the input resistance of the voltage amplifier 534 can be relatively low at 100-200Ω, and the capacitance of capacitor C_(p) can be chosen to give an RC time constant on the order of 1/(2 πf) seconds, where f is the operating frequency in Hz. In a preferred embodiment, the operating frequency is a frequency less than 2.2 GHz.

Yet another embodiment of a peak detector is shown in FIG. 5D. In FIG. 5D, an input signal V_(in) is applied to the gate of transistor M1 configured as a source follower. During envelope detection, transistor M2 is turned off, and the voltage V_(out) across the capacitor C follows the envelope of the input signal V_(in). To reset the voltage V_(out), transistor M2 can be turned on.

One of ordinary skill in the art will recognize that various alternative implementations of envelope detectors known in the art may be substituted for the detectors shown in FIGS. 5-5D. The disclosed implementations are not meant to limit the scope of the pre-distortion apparatus.

Quadrature Generator

FIGS. 6A-6H show several possible embodiments of quadrature phase generators 401 and 414 shown in FIG. 4. FIG. 6A shows a standard RC-CR network well known in the prior art. (See, e.g., Behzad Razavi, RF Microelectronics, Prentice Hall PTR (1998), pp 138-139.)

FIG. 6B shows a phase-shifter implemented using a Hilbert transformer 690.

FIG. 6C shows a quadrature generator implemented using an active LC network circuit. The following equations show the relationships of the signals in FIG. 6C:

V₂ = −V₁ V₃ = −V₄ $\begin{matrix} {V_{4} = \frac{G_{1}\left( {V_{1} - V_{2}} \right)}{2{sC}}} \\ {{= \frac{G_{1}V_{1}}{sC}},} \end{matrix}$ where G₁ and G₂ represent the forward transconductances of the respective transconductors 602 and 603 shown in FIG. 6C, and s is the Laplace transform variable. It can be seen therefore that the differential signal V₁-V₂ will have a quadrature phase relationship with the differential signal V₃-V₄.

Referring to FIG. 6C, a differential-input-to-differential-output transconductor block 601 converts a single-ended input signal V_(s) to a signal current of g_(m)V_(s)/2 that flows into transconductor 601 at one of its output ports and out of transconductor 601 at its other output port. Voltages V₁ and V₂ are supplied to a differential gyrator 604, which generates output signals V₃ and V₄. The differential gyrator 604 comprises two transconductors 602 and 603.

FIG. 6G shows one embodiment of the active LC network circuit of FIG. 6C, wherein the input transconductance stage 601 is modeled as two transconductors 620 and 621 that each generate a signal current proportional to the input voltage V_(s). A resistance R₀ and a capacitance C₀ are also associated with each of the two transconductors 620 and 621.

The transfer functions of this circuit can be derived as:

${H_{1}(s)} = {\frac{V_{1}}{V_{s}} = {{- \frac{V_{2}}{V_{s}}} = \frac{\left( \frac{H_{0}s}{Q\;\omega_{0}} \right)}{1 + \frac{s}{Q\;\omega_{0}} + \left( \frac{s}{\omega_{0}} \right)^{2}}}}$ ${{H_{2}(s)} = {\frac{V_{4}}{V_{s}} = {{- \frac{V_{3}}{V_{s}}} = \frac{\left( \frac{H_{0}G_{1}}{Q\;\omega_{0}C} \right)}{1 + \frac{s}{Q\;\omega_{0}} + \left( \frac{s}{\omega_{0}} \right)^{2}}}}},{{where}\text{:}}$ $H_{0} = \frac{g_{m}R_{0}}{2}$ $\omega_{0} = \sqrt{\frac{2G_{1}G_{2}}{{CC}_{0}}}$ $Q = {R_{0}\sqrt{\frac{C_{0}}{C}}\sqrt{2G_{1}G_{2}}}$

In the above equations, the parameter H₀ corresponds to the center frequency gain of the H₁(jω) transfer function or the low-frequency gain of the H₂(jω) transfer function, ω₀ corresponds to the center frequency of the H₁(jω) transfer function or the 3-dB bandwidth of the H₂(jω) transfer function, and Q corresponds to the quality factor of the H₁(jω) transfer function or the H₂(jω) transfer function.

To allow the quadrature generator to operate over a broad range of frequencies, the parameters may be adjusted based on the particular frequency range. FIG. 6D shows a modified active LC circuit wherein the capacitance C is adjustable by configuring a set of switches connected to a series of capacitors 631. The capacitance C may be implemented as a bank of switchable shunt capacitors, up to five capacitors in an embodiment, to afford amplitude equalization of the in-phase and quadrature components throughout the passband of interest. The banks allow dynamic setting of the parameter C, which controls the parameters Q and ω₀ per the equations given above.

FIG. 6D also shows the technique of employing a bank of capacitors to allow selective switching of the capacitance C₀. One of ordinary skill in the art will note that as the capacitors shown in FIG. 6D are connected in shunt across their respective nodes V1-V2 and V3-V4, whereas the capacitors shown in FIG. 6C are shunted to ground, appropriate scaling in values should be made.

Note that proper design also requires accounting for the parasitic capacitances (labeled “Parasitic” in FIG. 6D) present at the nodes corresponding to voltages V1, V2, V3, and V4.

For fine-tuning the capacitance C or C₀, one or more of the capacitors in each bank may be continuously adjustable via voltage control. This may be accomplished by implementing these capacitors as varactors or MOSCAPs.

In a preferred embodiment, the parameters g_(m), C, C₀, R₀, G₁ and G₂ are chosen as follows:

G₂R₀ = 1/2 $\frac{G_{1}}{G_{2}} = \frac{16}{25}$ ${\omega_{0} = {{2\pi\; f_{0}} = \frac{5G_{1}}{4C}}},$

wherein f₀ is selectable among five different values 0.982, 1.237, 1.557, 1.961, and 2.470 GHz by appropriate switching of the capacitors within the capacitor bank. These settings enable broadband operation over the approximate frequency range 0.7-2.218 GHz with generally less than 1-dB gain difference between the I and Q components.

A transistor implementation of the input stage block 601 in FIG. 6C is shown in FIG. 6E. In this circuit, transistors M1 and M2 comprise a differential pair, and transistors M3 and M4 comprise load devices. Transistors M6 and M7 have shorted drain and source terminals, and are disposed at the nodes corresponding to output voltages V1 and V2, respectively. It is seen that transistors M6 and M7 are configured as MOS capacitors (MOSCAP's). When sized appropriately, capacitors M6 and M7 can help neutralize the gate-drain capacitances of transistors M1 and M2, helping to mitigate bandwidth degradation incurred by Miller multiplication.

In an embodiment, the gate areas of M6 and M7 may be chosen to be nominally 15% larger than those of M1 and M2 to account for second order gate overlap and other phenomena associated with transistor gate-drain capacitances. In general, preferred W/L ratios for the transistors will be within a range of 4 to 100, and preferably within a range of 4 to 20.

FIG. 6F shows a possible implementation of one of the transconductors G₁ or G₂ in the differential gyrator 604 shown in FIG. 6C. This implementation is appropriate if common-mode signal components at the input port are negligible. Note the input stage can be a simple differential pair. In a preferred embodiment, the output resistance of the transconductor can be boosted to better approximate an ideal current source by using the circuit shown in FIG. 6G.

The circuit in FIG. 6G incorporates a negative resistance block 610 in shunt between the output nodes 611 and 612. This block 610 presents an impedance R₁₂ between nodes 611 and 612 expressed as:

${\frac{1}{R_{12}} = {\frac{1}{2r_{oa}} + \frac{1}{r_{oc}} - \frac{g_{Ma}}{2}}},$

where r_(oa) and r_(oc) represent the small-signal drain-source channel resistances of transistors Ma and Mc, respectively (assuming Ma and Mb are matched and have identical output resistances), and g_(Ma) is the transconductance of transistor Ma. The negative resistance of the block 610 is adjustable via the control voltage Vc. The negative resistance block 610 overall acts to increase the possibly small channel resistances of transistors M1 and M2 shown in FIG. 6G. Because the outputs of the transconductor blocks function effectively as current sources, the output resistance should be made large, preferably on the order of at least 5,000 Ohms.

FIG. 6H shows yet another possible embodiment of a quadrature-phase generator known as an “injection-locked quadrature generator,” which is suitable for high-frequency operation. In this embodiment, the two differential pairs, M3-M4 and M5-M6, in conjunction with the resonant circuits comprised of inductances L, capacitances C, and resistances R form low quality factor negative resistance oscillators. The resonant circuits are tuned to half of the frequency of the applied differential signal, V_(s). This signal establishes sinusoidal tail currents flowing through M1 and M2, where the tail current of M2 is 180 degrees out of phase with that of M1. The high impedance at the drains of M1 and M2 establish virtual signal grounds at the source terminals of each of the two differential pairs. The inductances, L_(ss), are used to establish 50-Ohm input terminations for V_(s) at the frequency implicit to V_(s). Because the output signals, V_(I) and V_(Q) are referenced to ground, and hence to the aforementioned virtual grounds, they represent gate-source voltages of M3-M4 and M5-M6. But the gate source voltage is a square root function of the drain current. Since the current in the drains of M5-M6 are 180 degrees phase displaced from those of M3-M4, V_(I) is resultantly a sinusoid at half the frequency of V_(s), while V_(Q) is likewise a sinusoid at half the frequency of V_(s), but 90 degrees out of phase with V_(I).

Thus to generate I and Q versions of a signal V_(input) using the above scheme, V_(input) may be first squared using a multiplier, and the squared signal supplied to the circuit in FIG. 6H as V_(s).

One of ordinary skill in the art will appreciate that the above method of quadrature generation need not be implemented using identical components as disclosed in FIG. 6H. In general, quadrature generation may be effected by simply squaring an input signal, providing positive and negative versions of the squared signal, and separately applying a square root function to each of the positive and negative versions of the squared signal. The resultant two square-rooted signals will then necessarily have a quadrature relationship.

One of ordinary skill in the art will appreciate that various implementations of a quadrature generator are possible other than those disclosed herein with respect to FIGS. 6A-6H. The disclosed implementations are not meant to limit the scope of the pre-distortion apparatus.

Variable-gain Amplifier (VGA)

FIG. 8 shows a preferred implementation of the RF variable-gain amplifiers (VGA) 405.1 and 405.2 in FIG. 4 using transconductors, i.e., circuits that convert voltage signals into current signals. The differential input signal 810 can be the I signal 401 a or Q signal 401 b shown in FIG. 4. The RF VGA comprises an input signal 810, an output signal 811, and a plurality of control signals G₁ Control 812, G₂ Control 813, G₃ Control 814, and G₄ Control 815. The VGA further comprises capacitors 816, 817, 818, and 819. By adjusting the control signals 812-815 and capacitances of capacitors 816-819, the gain, center frequency, 3-dB bandwidth, and quality factor of the transfer function between the input signal 810 and the output signal 811 can all be independently adjusted. The transfer function of the VGA shown in FIG. 8 can be expressed as:

$\quad\begin{matrix} {{H(s)} = \frac{V_{o}}{V_{i}}} \\ {= \frac{{H\left( {j\omega}_{0} \right)}\left( \frac{s}{Q\;\omega_{0}} \right)}{1 + \frac{s}{Q\;\omega_{0}} + \left( \frac{s}{\omega_{0}} \right)^{2}}} \\ {= \frac{s\left( \frac{G_{4}C_{x}}{G_{1}G_{2}} \right)}{1 + {s\left( \frac{G_{3}C_{x}}{G_{1}G_{2}} \right)} + {s^{2}\left( \frac{C_{x}C_{y}}{G_{1}G_{2}} \right)}}} \end{matrix}$

In these expressions, ω₀ represents the tuned center frequency in radians, H(jω₀) represents the amplifier gain at the tuned center frequency ω₀, and Q represents the quality factor of the bandpass transfer characteristic. From the above transfer function, the tunable parameters of the VGA are seen to be:

$\omega_{0} = {{2\pi\; f_{0}} = \sqrt{\frac{G_{1}G_{2}}{C_{x}C_{y}}}}$ ${H\left( {j\omega}_{0} \right)} = \frac{G_{4}}{G_{3}}$ $B = {\frac{\omega_{0}}{Q} = {\frac{G_{3}}{C_{y}} = \left( {3 - {{dB}\mspace{14mu}{bandwidth}}} \right)}}$ $Q = {\frac{\sqrt{G_{1}G_{2}}}{G_{3}}\sqrt{\frac{C_{y}}{C_{x}}}}$

Each of these parameters may thus be set by appropriately choosing the control signals 812-815 and capacitances of capacitors 816-819. One of ordinary skill in the art will realize that fewer or more transconductors may be provided than shown in FIG. 8, along with associated capacitances, to afford fewer or more degrees of freedom in choosing the design parameters. For example, an additional transconductor with a configurable gain may be disposed in series between G₁ and G₂ shown in FIG. 8.

FIG. 8A shows an alternative capacitor arrangement for the VGA shown in FIG. 8. Parasitic capacitances 822 and 823 may be incorporated into the values of the overall capacitances at nodes 820 and 821. Note two shunt capacitors 824 and 825, each of capacitance C/2, may be used rather than one capacitor of capacitance C to sustain signal condition balance, since in general any monolithic capacitance may be accompanied by an unavoidable parasitic capacitance at one (but generally not both) of its terminals.

FIG. 8B shows a circuit implementation of the transconductors G1, G2, G3 and G4. This circuit accepts a differential input signal comprising the signals V₁ 801 and V₂ 802. The gain of the transistors M3 and M4 can be varied based on an input signal V_(Q) 805. The differential output signal of the circuit comprises the difference between the currents I_(d1) and I_(d2). In the circuit shown, by cross-coupling the drain connections of M5 with M2, and by cross-coupling the drain connections of M6 with M1, and while sinking the drain currents of all four of these devices through a common, constant current sink, I_(ss), large-signal linearity between the differential current response, I_(d1)-I_(d2), and the differential input signal, V₁-V₂, is achieved. Moreover, the topology renders the constant of proportionality between the differential output current and the differential input voltage itself linearly proportional to the indicated control voltage, V_(Q). Note that the current source I_(ss) should provide a relatively constant current, with ideally very high output resistance.

Note in a preferred implementation, transistors M1, M2, M5, and M6 are matched.

The relationships of the signals in the circuit are given as: I _(d1) −I _(d2) =G _(m)(V ₁ −V ₂) G _(m) =K _(n) (W/L) V _(Q),

where K_(n) is the NMOS transconductance density parameter μ_(n)C_(ox), and W and L are the width and length, respectively, of the channel areas of transistors M₁ and M₂.

FIG. 8C shows an alternative circuit implementation of the transconductors G1, G2, G3 and G4, utilizing both NMOS and PMOS transistors. This circuit utilizes complementary field effect transistor (COMFET) technology, as indicated by the topology of transistors M1 a, M2, and M1 b. COMFET technology offers a decreased effective threshold voltage for operation from low-voltage power supplies, and is described in detail in D. Johns and K. Martin, Analog Integrated Circuit Design, John Wiley & Sons, Inc. (1997). The input voltages V1 and V2 can be expressed in terms of a common-mode voltage Vcm and a differential voltage Vdi as: V1=Vcm+Vdi/2; V2=Vcm−Vdi/2.

The large-signal output currents Id1 and Id2 can be expressed as:

$I_{d\; 1} = {\frac{K_{ne}}{2}\left( {V_{1} - V_{Q} - V_{h}} \right)^{2}}$ $I_{d\; 2} = {\frac{K_{ne}}{2}\left( {V_{2} - V_{Q} - V_{h}} \right)^{2}}$

where K_(ne) is the effective K_(n)W/L transconductance density of the COMFETs formed by the interconnection of NMOS and PMOS transistors, and V_(h) is the effective and invariably diminished threshold voltage offered by the COMFET interconnection. From the large-signal output signal, the small signal differential output current can be derived as: I _(d1) −I _(d2) =K _(ne)(V _(cm) −V _(Q) −V _(h)) V _(di)

One of ordinary skill in the art will recognize that various implementations of variable gain amplifiers are known in the art, and may be substituted for the embodiments shown in FIGS. 8, 8A, 8B, and 8C. The disclosed implementations are not meant to limit the scope of the pre-distortion apparatus.

Error Signal Generator

FIG. 9 shows an implementation of the error signal generator block 303 shown in FIG. 4. In FIG. 9, two RF signals 901 and 902 can be input to single-to-differential ended converters 903 and 904. The converter 903 can output a differential signal 903 a, while the converter 904 can output a differential signal 904 a. In an embodiment of the pre-distortion apparatus, the signal 901 can be the buffered reference signal 412 a shown in FIG. 4, while signal 902 can be the buffered feedback signal 415 a, also shown in FIG. 4.

To commensurately compare between the signals 901 and 902, AGC's 905 and 906 can be provided to adjust the amplitudes of the differential signals 903 a and 904 a, while the delay-locked loop (DLL) 907 can be provided to adjust the delays of the differential signals. In conjunction with the coarse amplitude adjustment of the scale block 105 in FIG. 4, the AGC 906 can serve to adjust for any gains introduced to the datapath signal 115 a before arriving at the error signal generator 303 as the feedback signal 415 a, including the power gain introduced by the power amplifier 107. Similarly, the AGC 905 can adjust for any gain introduced to the reference signal 412 a. Each automatic gain control circuit 905 or 906 can accept as input control signals a bandgap voltage reference signal 910 and a filtering capacitor 911 or 912 for setting the bandwidth of the AGC. In a preferred embodiment, the capacitor can be chosen such that the bandwidth of the AGC is 200 MHZ.

The output signals 905 a and 906 a of the AGC's 905 and 906 may be input to a delay-locked loop (DLL) 907. The DLL 907, in conjunction with the coarse delay block 116 in FIG. 1, can serve to synchronize the reference signal 901 with the feedback signal 902 by adjusting for any difference in delays experienced by the signals, including the delay of the power amplifier 107. The signals 907 a and 907 b may then be input to a differencing amplifier 908, which can generate an error signal e(t) 908 that is a function of the difference between the two signals 907 a and 907 b. In a preferred embodiment of the differencing amplifier, the amplifier can be a saturating difference amplifier, i.e., the output signal of the amplifier can saturate at a maximum voltage level when the difference between the input signals exceeds a certain voltage, and likewise, the output signal of the amplifier can saturate at a minimum voltage level when the difference between the input signals is below a certain voltage.

Various embodiments of a saturating difference amplifier are possible. One embodiment is an amplifier outputting a function of the difference such as tanh [T·diff], where tanh is the hyperbolic tangent function, T is a chosen gain parameter, and diff is the difference between the input signals 907 a and 907 b. Such a function may have the advantage of providing an appropriately large error gain T for small differences (diff) to overcome possible offsets in the amplifier, while still limiting (saturating) the gain for large differences to avoid adversely impacting the convergence of the adaptive algorithm performed by the Adapt P block 403.1 or Adapt Q block 403.2 in FIG. 4. In a preferred embodiment, the gain T may range from 30 to 50. The output signal 908 a may saturate at plus or minus 1 V. One of ordinary skill in the art will recognize that other implementations of saturating difference amplifiers are possible, including one wherein the output signal comprises a rising linear characteristic that saturates for large enough input signal differences.

The descriptions above are not intended to be exhaustive or to limit the invention to the precise form disclosed. It should be understood that the invention can be practiced with modification and alteration and that the invention be limited only by the claims and the equivalents thereof. 

1. A pre-distortion apparatus comprising: a datapath for carrying a datapath signal, a source of a reference signal, and a feedback path for carrying feedback signal; an error signal generator comprising a difference amplifier, wherein the input signals to said difference amplifier comprise: 1) a first amplifier input signal derived from the reference signal, and 2) a second amplifier input signal derived from the feedback signal, and wherein the output signal of said amplifier comprises an error signal; an adaptive block comprising: an analysis basis function generator for generating a plurality of analysis basis functions; a plurality of correlators for correlating the error signal with each of said plurality of analysis basis functions, the output signals of the plurality of correlators comprising a plurality of correlation coefficients; a synthesis block for generating a plurality of synthesis work functions and for generating a weighted sum of said plurality of synthesis work functions, wherein each synthesis work function is weighted by a corresponding one of said plurality of correlation coefficients; a variable gain amplifier (VGA) for multiplying said datapath signal with said weighted sum of said plurality of synthesis work functions.
 2. The apparatus of claim 1, further comprising: a datapath envelope generator for generating a datapath envelope signal of the datapath signal; a datapath power generator for generating a plurality of raised powers of the datapath envelope signal; wherein: each of said plurality of synthesis work functions comprises a linear combination of said raised powers of the datapath envelope signal.
 3. The apparatus of claim 1, further comprising a pre-distorted output signal, wherein: the datapath signal, reference signal, and feedback signal are complex signals, and the pre-distorted output signal is a real signal comprising the real portion of the output signal of said VGA.
 4. The apparatus of claim 1, wherein the datapath signal, reference signal, and feedback signal are real signals.
 5. The apparatus of claim 4, further comprising a datapath phase generator for generating in-phase (I) and quadrature-phase (Q) components of the datapath signal, and a reference phase generator for generating in-phase (I) and quadrature-phase (Q) components of the reference signal, wherein: each of said plurality of analysis basis functions comprises an I analysis function and a Q analysis function; each of said plurality of correlation coefficients comprises an I coefficient generated by correlating the error signal with one of said plurality of I analysis functions, and a Q coefficient generated by correlating the error signal with one of said plurality of Q analysis functions; each of said plurality of synthesis work functions comprises an I synthesis function and a Q synthesis function; each of said plurality of weighted synthesis work functions comprises: a weighted I synthesis function comprising one of said plurality of I synthesis functions multiplied by one of said plurality of I coefficients; and a weighted Q synthesis function comprising one of said plurality of Q synthesis functions multiplied by one of said plurality of Q coefficients; said VGA comprises an I multiplier for multiplying said I component of the datapath signal with said weighted sum of I synthesis functions, and a Q VGA for multiplying said Q component of the datapath signal with said weighted sum of Q synthesis functions; said apparatus further comprising: a summer for summing the output signal of the I VGA with the output signal of the Q VGA; and a pre-distorted output signal comprising the output signal of the summer.
 6. The apparatus of claim 5, further comprising: a datapath envelope detector for generating a datapath envelope signal of the datapath signal; a datapath power generator for generating a plurality of raised powers of the datapath envelope signal; wherein: each of said plurality of I synthesis work functions comprises a linear combination of said raised powers of the datapath envelope signal, and each of said plurality of Q synthesis work functions comprises a linear combination of said raised powers of the datapath envelope signal.
 7. The apparatus of claim 2, wherein the datapath signal is a signal modulated on a carrier signal.
 8. The apparatus of claim 2, wherein the error signal saturates at a first voltage level when the first amplifier input signal is greater than the second amplifier input signal by a first pre-determined threshold, and the error signal saturates at a second voltage level when the second amplifier input signal is greater than the first amplifier input signal by a second pre-determined threshold.
 9. The apparatus of claim 8, wherein the error signal varies substantially linearly with the difference between the first input signal and the second input signal, when the absolute difference between the first amplifier input signal and the second amplifier input signal is less than a pre-determined value.
 10. The apparatus of claim 2, wherein the error signal is derived from a hyperbolic tangent (tanh) function of a pre-determined gain (T) times the difference between the first input signal and the second input signal.
 11. The apparatus of claim 2, wherein each of said plurality of analysis work functions consists of a raised power of said reference envelope signal.
 12. The apparatus of claim 2, wherein each of said plurality of analysis work functions is orthogonal to every other of said plurality of analysis work functions.
 13. The apparatus of claim 2, wherein the plurality of work functions are chosen according to a Cholesky method for generating orthogonal functions.
 14. The apparatus of claim 2, wherein the plurality of work functions are chosen to have less than a maximum eigenvalue spread.
 15. The apparatus of claim 2, wherein the adaptive block is configurable such that the correlation coefficients are held constant in response to a freeze-adapt signal.
 16. The apparatus of claim 2, wherein the adaptive block is configurable such that said weighted sum of said plurality of synthesis work functions is held constant in response to a freeze-adapt signal.
 17. The apparatus of claim 15, wherein the freeze-adapt signal is applied when the power of the pre-distorted signal exceeds a pre-determined threshold.
 18. The apparatus of claim 2, wherein the error signal generator further comprises a delay-locked loop (DLL), and wherein: the input signals to the DLL comprise: 1) a first DLL input signal derived from the reference signal, and 2) a second DLL input signal derived from the feedback signal; the output signals of the DLL comprise: 1) the first amplifier input signal derived from the reference signal, and 2) the second amplifier input signal derived from the feedback signal; and the DLL adjusts the relative delay between the first amplifier input signal and the second amplifier input signal.
 19. The apparatus of claim 18, wherein the DLL aligns the first amplifier input signal with the second amplifier input signal.
 20. The apparatus of claim 18, further comprising: a first automatic gain control circuit (AGC), wherein the input signal of the first AGC comprises a signal derived from the reference signal, and the output signal of the first AGC comprises the first DLL input signal; a second automatic gain control circuit (AGC), wherein the input signal of the second AGC comprises a signal derived from the error signal, and the output signal of the second AGC comprises the second DLL input signal.
 21. The apparatus of claim 2, wherein the datapath signal, reference signal, feedback signal, and pre-distorted signal are complex signals; the correlation coefficients are complex coefficients; the synthesis work functions and the analysis work functions are complex functions; and the VGA performs a complex multiplication.
 22. The apparatus of claim 5, further comprising: a power amplifier for amplifying the pre-distorted output signal, the output signal of the power amplifier comprising a power amplifier (PA) output signal; an attenuator for attenuating the PA output signal, the output signal of said attenuator comprising said feedback signal; and a delay block for delaying the datapath signal, the output signal of the delay block comprising said reference signal.
 23. A pre-distortion apparatus comprising: a datapath signal, a reference signal, and a feedback signal; an error signal generator means for generating an error signal from input signals comprising: 1) a first input signal derived from the reference signal, and 2) a second input signal derived from the feedback signal; an adaptive block means for generating a plurality of correlation coefficients between a plurality of analysis basis functions and said error signal; a synthesis block means for multiplying each of a plurality of synthesis work functions with one of said plurality of correlation coefficients, and for summing the products of such multiplications to generate a weighted sum; a variable gain amplifier VGA for multiplying said datapath signal with said weighted sum.
 24. The apparatus of claim 23, further comprising a pre-distorted output signal, wherein: the datapath signal, reference signal, and feedback signal are complex signals, and the pre-distorted output signal is a real signal comprising the real portion of the output signal of said VGA.
 25. The apparatus of claim 23, wherein the datapath signal is a signal modulated on a carrier signal.
 26. The apparatus of claim 25, further comprising: a power amplifier means for amplifying the pre-distorted output signal, the output signal of the power amplifier means comprising a power amplifier output signal; an attenuator means for attenuating the PA output signal, the output signal of said attenuator means comprising said feedback signal; and a delay block means for delaying the datapath signal, the output signal of the delay block means comprising said reference signal. 